Back

Horticulture Research

Oxford University Press (OUP)

Preprints posted in the last 90 days, ranked by how well they match Horticulture Research's content profile, based on 43 papers previously published here. The average preprint has a 0.09% match score for this journal, so anything above that is already an above-average fit.

1
What makes a banana false? How the genome of Ethiopian orphan staple Ensete ventricosum differs from the banana A and B sub-genomes

Muzemil, S.; Paul, P.; Baxter, L.; Dominguez-Ferreras, A.; Sahu, S. K.; Van Deynze, A.; Mai, G.; Yemataw, Z.; Tesfaye, K.; Ntoukakis, V.; Studholme, D. J.; Grant, M.

2026-02-23 bioinformatics 10.64898/2026.02.21.706659 medRxiv
Top 0.1%
53.1%
Show abstract

BackgroundEnsete ventricosum, also known as the "tree against hunger" plays a key role in Ethiopian food security and farming systems, feeding more than 20 million people. Since domestication via clonal selection in the south-west Ethiopian highlands, todays diverse enset landraces contribute multiple benefits including food, fibre by-product, animal bedding and cattle fodder to farmers and local communities. Improved genomic resources for this highly drought-tolerant plant are essential to supplement the conventional clonal selection-based breeding programme and pave the way towards targeted breeding. ResultsWe sequenced the genome of enset landrace Mazia, which is partially resistant/tolerant to Xanthomonas wilt and predicted 38,940 protein-coding genes. The Mazia assembly (540.14 Mb) is more complete than the previously published genome assembly of landrace Bedadeti (451.28 Mb) and displayed 1.41% heterozygosity and 64.64% repetitive DNA content. Comparative analyses with the Bedadeti assembly and chromosome-level genome sequences of the two main banana progenitors (Musa acuminata, AA genome; Musa balbisiana, BB genome) unexpectedly revealed [~]25% of the Mazia genome is unique to enset. Gene Ontology (GO) and sequence similarity search analysis of enset-specific protein-coding genes identified distinct functional signatures that underpin the lifestyle, adaptation, and corm productive quality of enset, including functions related to DNA integration, carbohydrate metabolism, disease resistance and transcriptional regulation. In contrast, Musa-specific genes showed enrichment for defence response, protein phosphorylation and fruit development pathways. Focusing on the classical nucleotide binding site leucine rich repeat (NLR) disease resistance genes, we identified and characterised NLRs in enset and Musa species genomes, revealing a considerable expansion in the Musa acuminata genome. We also identified unique genes in enset and banana genomes whose functional and evolutionary roles are yet to be determined. ConclusionsHere, we report a de novo genome assembly for the enset (Ensete ventricosum) landrace Mazia and provide a high-quality annotation of both Mazia and the previously published assembly of the landrace Bedadeti. Collectively, these genomic resources provide a valuable foundation for comparative genomics within the Musaceae family and open new opportunities for the development of marker-assisted breeding strategies to accelerate the improvement of agronomically important traits in enset.

2
Haplotype specific analyses in the phased genomes era: the case of apple cv "Golden Delicious"

Bianco, L.; Busatto, N.; Moser, M.; Micheletti, D.; Spina, L.; Troggio, M.; Piazza, S.; Costa, F.; Fontana, P.

2026-02-02 genomics 10.64898/2026.01.28.702214 medRxiv
Top 0.1%
51.8%
Show abstract

Apple is one of the major cultivated fruit crops in temperate regions. To better support breeding programs and facilitate the development of improved cultivars, we generated a new haplotype-resolved version of the Golden Delicious genome, one of the major founders of many modern apple lineages. The assembly features the separation of the two haplotypes, with a total size of 647.3 Mb and 649.2 Mb, respectively. The phasing was accurately validated with 10,321 curated SNPs. Telomere-to-telomere continuity was verified by the analysis of telomeric sequence composition at the end of each chromosome. Gene prediction identified a total of 45,116 genes in haplotype 1 and 45,063 genes in haplotype 2. A pangenome analysis employing 6 haplotype resolved genomes identified both common and unique gene families. The availability of a phased genome enabled the assessment of genome-wide allelic specific expression. Our case study, focusing on Md-PG1 (a key regulator of fruit softening), revealed that the allelic form on haplotype 2 (GDH2-10g24673) was the dominant contributor to total gene expression. In addition, the phased genome also showed specific miRNA chromosomal distribution patterns, as well as a distinct methylation profile. Altogether, these genomic resources provide new insights into the allelic regulation of key agronomic traits and represent a valuable tool to accelerate apple breeding.

3
Genome-wide characterization of extant clonal diversity in Chilean Carmenere

Garcia, J.; Cochetel, N.; Balic, J.; Barros, S.; Figueroa-Balderas, R.; Castro, A.; Cantu, D.

2026-04-07 genomics 10.64898/2026.04.03.716224 medRxiv
Top 0.1%
40.5%
Show abstract

Carmenere is a widely cultivated and internationally recognized grapevine cultivar in Chile, yet genetic variation among its clones remains poorly characterized. Early studies based on SSR and AFLP markers detected limited polymorphism, but these approaches interrogate only a small fraction of the genome, leaving the extent of clonal diversity unresolved. Here, we generated an improved chromosome-scale diploid genome assembly of Carmenere FPS clone 02 and characterized clonal genomic diversity by sequencing 36 biological replicates representing 12 clones maintained in Chile, including heritage selections rescued from old producer vineyards by Vina Santa Carolina as part of its Bloque Herencia conservation program, and commercial nursery-derived clones. Focusing on low-frequency variants and using replicate-aware consensus calling, we identified more than 9,000 private single nucleotide variants (SNVs) and small indels per clone, providing high-resolution markers for clonal identification. Although most variants were located in repetitive or intergenic regions, a subset affected coding sequences, with genes involved in plant-pathogen interactions, transport, and secondary metabolism most frequently impacted. While variant-affected genes associated with wine anthocyanin content, TA, pH, and alcohol percentage were identified, broader phenotypic characterization will be required to assess their biological significance. Overall, this study provides a genome-wide characterization of extant clonal diversity in Carmenere, with implications for clonal selection and genetic resource conservation.

4
Exploring the genetic architecture underlying dietary fiber content in Colombian Andean blueberry (Vaccinium meridionale Swartz)

Anacona, G. P. V.; Correa, A. C. G.; Narvaez Cuenca, C. E.; Vasquez, T. M.; Soto Sedano, J. C.

2026-02-20 genetics 10.64898/2026.02.19.706934 medRxiv
Top 0.1%
37.4%
Show abstract

Dietary fiber composition is a major determinant of fruit nutritional quality, yet its genetic basis remains poorly characterized in wild Vaccinium species. Here, we combined extensive phenotyping with a genome-wide association study (GWAS) to dissect the genetic control of dietary fiber traits in Colombian agraz (Vaccinium meridionale Swartz). Total dietary fiber (TDF), insoluble dietary fiber (IDF), and soluble dietary fiber (SDF), the SDF/IDF ratio, and maturity index (MI) were quantified in fruits from 119 genotypes, representing the most comprehensive evaluation of dietary fiber fractions in fresh Vaccinium fruit to date. GWAS mapped this phenotypic diversity to 24 QTLs distributed across 15 chromosomes, revealing a polygenic architecture underlying fiber-related trait. A TDF QTL (Chr41:26883013) directly co-localized with VaccDscaff31-augustus-gene-268.33, a 7-deoxyloganetin glucosyltransferase, embedded within a glycosyltransferase-rich LD block. IDF variation was associated with VaccDscaff33-processed-gene-116.2 (pectin methylesterase 15) while the SDF/IDF ratio co-localized with VaccDscaff55-augustus-gene-9.30, encoding a xyloglucan endotransglucosylase/hydrolase. Together, the integration of high-resolution phenotyping with QTL mapping connects natural variation in dietary fiber content and composition to specific biosynthetic, remodeling, and regulatory pathways, providing actionable molecular targets for marker-assisted and genomic selection aimed at improving nutritional quality, texture, and processing traits in Vaccinium breeding programs.

5
Pangenome Analysis of Pyrus Reveals Potential Role of Transcription Factors in Recent Adaptation to Arid Environments

Labbancz, J.; Dhingra, A.

2026-01-22 genomics 10.64898/2026.01.20.700718 medRxiv
Top 0.1%
31.6%
Show abstract

Pears (genus Pyrus) are among the most extensively cultivated tree fruits with a wide-reaching economic impact. Despite this, the genetic basis of most pear traits of interest, including abiotic stress tolerance, tree architecture, precocity, parthenocarpy, disease resistance, and fruit ripening, remains poorly understood. Although extensive efforts have been made to identify quantitative trait loci (QTLs) that explain the genetic basis of pear traits, many are poorly transferable, limiting their utility for informing genetic improvement or management of pears across most genetic backgrounds. To provide a whole-genome context and enable the exploration of functional variation in Pyrus, we developed a pangenome graph using 31 accessions representing 23 Pyrus species from the National Clonal Germplasm Repository. Whole-genome sequencing was performed solely with Oxford Nanopore, generating highly contiguous assemblies for pangenome construction, demonstrating the viability of a single-platform approach to pangenomic analysis in Viridiplantae. Exploration of the pangenome graph reveals genes present in some lineages, with potential functional implications. A group of arid-adapted Pyrus species exhibits signs of selective sweeps in regions associated with transcription factors, likely impacting abiotic stress tolerance. With the development of this pangenomic resource, resequencing analysis in Pyrus is now possible without the limitations imposed by single-reference genome assemblies.

6
iGS: A Zero-Code Dual-Engine Graphical Software for Polygenic Trait Prediction

Zhang, J.; Chen, F.

2026-03-03 bioinformatics 10.64898/2026.02.28.708730 medRxiv
Top 0.1%
28.1%
Show abstract

Genomic selection (GS) has become the core driving force in modern plant and animal breeding. However, state-of-the-art comprehensive GS tools often rely on complex underlying environment configurations and command-line operations, posing significant technical barriers for breeders lacking programming expertise. To address this critical pain point, this study developed a fully "zero-code" graphical user interface (GUI) decision support system for genomic selection. The platform innovatively employs a "portable dual-engine architecture" (R-Portable and Python-Portable) to achieve completely dependency-free, "out-of-the-box" deployment, and integrates a standardized six-step end-to-end workflow from data quality control to result export. Furthermore, the platform comprehensively integrates 33 cutting-edge prediction models across four major paradigms, linear, Bayesian, machine learning, and deep learning, and features an original intelligent parameter configuration system that dynamically renders algorithm parameters to provide a minimalist UI interaction experience. Benchmark testing on the Wheat2000 dataset across six complex agronomic and quality traits, including thousand-kernel weight (TKW) and grain protein content (PROT), demonstrated that classic linear models remain highly robust for polygenic additive traits, while tree-based machine learning and hybrid deep learning architectures exhibit superior predictive potential and noise resilience when resolving complex epistatic effects and low-heritability traits. The successful deployment of this platform fundamentally liberates biologists from the constraints of computational science, providing robust digital infrastructure to accelerate the popularization and practical application of GS technologies in agricultural production.

7
The control of prickle formation in Rubus

St. Aubin, B.; Poorten, T.; Fister, A.; Ochsenfeld, C.; Reiner, J.; Castillo, A. S.; Aryal, R.; Bruna, T.; Dudchenko, O.; Sargent, D.; Mead, D.; Buti, M.; Silva, A.; Pham, M.; Weisz, D.; Bassil, N.; Ashrafi, H.; Aiden, E. L.; Graham, N.; Chauhan, D.; Dean, E.; Lowry, W.; Redpath, L.; Marri, P.; Lawit, S.; Pham, G.; Worthington, M.; Crawford, B. C.

2026-03-12 plant biology 10.64898/2025.12.22.695586 medRxiv
Top 0.1%
27.5%
Show abstract

Prickles on blackberry and raspberry canes make pruning, harvesting, and handling more difficult and can increase labor costs for growers. The trait has been challenging to improve in these clonal crops because it is recessive and linked to undesirable agronomic traits. In blackberry and red raspberry, breeding programs have used recessive mutants at the S locus to generate prickleless cultivars for the last century. In this study, we identified independent loss-of-function mutations in a WUSCHEL-LIKE HOMEOBOX transcription factor, WOX1, as the genetic basis of the prickleless S locus in both blackberry and red raspberry. We mapped the S locus using integrated genome-wide association, bulked segregant analysis, and identity-by-descent analyses informed by breeding pedigrees. Additionally, we generated a genome sequence from Luther Burbanks prickleless blackberry variety Burbank Thornless that contained an additional allele of WOX1. To verify the genes role, we used gene editing to knock out WOX1 in an elite prickled commercial blackberry line. All edited plants were prickleless and lacked glandular trichomes, confirming that WOX1 controls a joint developmental pathway. Other plant traits were unchanged, indicating WOX1 is a specific and safe target for improvement. Gene editing can enable breeders to remove prickles directly from elite varieties, reducing the need for extensive breeding cycles and delivering safer, easier-to-harvest cultivars to growers.

8
A full-length GRAS domain protein underpins efficient shoot regeneration in pepper

Park, S.-J.; Yang, J. H.; Kim, H.-S.; Lee, H.-J.

2026-04-01 plant biology 10.64898/2026.04.01.715768 medRxiv
Top 0.1%
25.7%
Show abstract

Pepper (Capsicum annuum L.) is a recalcitrant species regarding shoot regeneration, a trait that serves as a major bottleneck for the application of genetic engineering tools. In this study, comparative genetic analysis between a rare high-regeneration cultivar and a common low-efficiency cultivar identified a single nucleotide polymorphism (SNP) in PHYTOCHROME A SIGNAL TRANSDUCTION 1 (CaPAT1) that determines shoot regeneration efficiency. The T478C SNP in the high-efficiency cultivar converts a stop codon into an Arg codon, leading to translational read-through into the neighboring gene and forming an intact GRAS domain. This SNP-mediated formation of full-length CaPAT1 is essential for its dimerization. Notably, the overexpression of CaPAT1T478C in multiple low-efficiency cultivars, including both hot and bell peppers, significantly improved both shoot regeneration and transformation efficiency in the transformed T0 generation. These findings demonstrate the pivotal role of CaPAT1 in enhancing shoot regeneration and provide a robust strategy to overcome recalcitrance in pepper.

9
Identification and Functional Characterization of Isoflavone Synthase Gene Family in Pea (Pisum sativum): The Entry Point to Pisatin Biosynthesis

Tahir, M. S.; Kuflu, K.; Islam, N. S.; Mcdowell, T.; Dhaubhadel, S.

2026-01-22 biochemistry 10.64898/2025.12.22.696109 medRxiv
Top 0.1%
22.3%
Show abstract

Isoflavone synthase (IFS), a cytochrome P450 monooxygenase of the CYP93C subfamily, catalyzes the conversion of flavanones into isoflavones, the first committed step in the biosynthesis of isoflavonoid phytoalexins. In pea (Pisum sativum L.), the phytoalexin pisatin plays a pivotal role in defense against pathogens. However, the molecular basis underlying IFS function in pea remains poorly understood. In this study, we performed a comprehensive genome-wide identification and characterization of IFS genes in pea. Three IFS candidates, PsIFS7A, PsIFS7B, and PsIFS7C, were identified that reside on chromosome 7, each harboring all conserved cytochrome P450 signature motifs. PsIFS genes exhibited predominant expression in root tissue, with transcript levels induced rapidly upon Aphanomyces euteiches infection. Enzymatic assays confirmed their catalytic activity in converting the flavanones naringenin and liquiritigenin into the isoflavones genistein and daidzein, respectively, both in vitro and in planta systems. Furthermore, all three PsIFS genes were found in close proximity to quantitative trait loci (QTL) associated with Aphanomyces root rot resistance. Together, these findings provide novel insights into the IFS gene family in pea and lay a foundation for metabolic engineering or molecular breeding strategies to enhance disease resistance through targeted modulation of pisatin biosynthesis.

10
T2T Genome and Population Resequencing Reveal OfCCD4 Alleles Orchestrate Petal Color and Scent in Osmanthus fragrans

Liu, S.; You, S.; Yuan, J.; Zeng, X.; Yang, Q.; Xu, S.; Xi, W.; Peng, Z.; Zhu, L.; Zhong, L.; Tan, Y.-f.; Zheng, R.-R.

2026-02-17 plant biology 10.64898/2026.02.14.705464 medRxiv
Top 0.1%
21.9%
Show abstract

Osmanthus fragrans is prized for its floral color and fragrance, both key targets for genetic improvement. However, the lack of a complete genome assembly and comprehensive population structure hinders gene dissection and marker development for breeding. To investigate the genetic basis of color-scent interaction, a telomere-to-telomere (T2T) genome assembly is generated, and whole-genome resequencing of 100 cultivars is performed. Integrative population and metabolomic analyses reveal a trade-off: orange-red ( Aurantiacus) cultivars accumulate high /{beta}-carotene but low aromatic apocarotenoids (/{beta}-ionone), while yellow-white cultivars show the opposite pattern. Divergence mapping identifies OfCCD4 as the major underlying locus. Three alleles are characterized--functional (A), partially functional (aDel), and a frameshift null (aStop)--with aStop strictly co-segregating with the Aurantiacus phenotype. Transient and stable transformations confirm that A and aDel cleave /{beta}-carotene into /{beta}-ionone, while aStop abolishes this activity. A co-dominant PCR marker is developed based on allele-specific polymorphisms. OfCCD4 is thus established as the key regulator of the color-scent trade-off in osmanthus. The identified alleles and molecular marker enable rapid, low-cost genotyping at the seedling stage--offering particular value for marker-assisted selection in ornamental breeding, where extended juvenility makes phenotypic selection highly time- and resource-intensive.

11
Next-Generation Soybean Haplotype Map as A Genomic Resource for Enhanced Trait Discovery and Functional Analysis

Khan, A. W.; Doddamani, D.; Song, Q.; Vuong, T. D.; Chhapekar, S. S.; Ye, H.; Garg, V.; Varshney, R. K.; Nguyen, H. T.

2026-03-26 genomics 10.64898/2026.03.24.713798 medRxiv
Top 0.1%
21.8%
Show abstract

We present a global soybean haplotype map generated from whole-genome sequencing of 1,278 Glycine max and Glycine soja accessions, comprising 11.37 million SNPs and 2.05 million short insertions and deletions. This map (GmHapMap-II) captures unprecedented worldwide genetic diversity, reflecting the broad extent of the global soybean gene pool. Population structure analyses revealed six geographically distinct subpopulations that affected the linkage and shaped the recombination. The haplotype variation map was used to identify novel genomic regions associated with crude protein content on chromosome 15 that were not detected by a lower SNP density array. LD-based haplotype analysis revealed a superior haplotype for crude protein content. The constructed haplotype map enabled detailed characterization of haplotype diversity and copy number polymorphism at the SCN-associated rhg-1 and Rhg-4 loci, revealing both novel haplotype structures and germplasm lines with elevated CNV relative to previously characterized genotypes. We employed the HapMap matrix for a multi-class variations ML-based genomic prediction approach to predict phenotypes for SCN and catalogued the gene-centric haplotypes in a user-friendly database. The analysis revealed the extent of deleterious alleles present in the soybean germplasm and how breeders have deployed beneficial alleles and purged deleterious ones. The haplotype map will serve as a major genomic resource for trait-based mapping, enhancing efforts in the genomics-enabled development of improved cultivars.

12
Unlocking the potential of Capsicum Germplasm Collections for Climate Resilience and Fruit Quality

Halpin-McCormick, A.; Nalla, M. K.; Radlicz, Z.; Zhang, A.; Fumia, N.; Lin, T.-h.; Lin, S.-w.; Wang, Y.-w.; Zohoungbogbo, H. P. F.; Wang, D. R.; Runck, B.; Gore, M. A.; Kantar, M. B.; Barchenger, D. W.

2026-03-28 plant biology 10.64898/2026.03.25.714358 medRxiv
Top 0.1%
18.7%
Show abstract

Climate change increasingly threatens global Capsicum (pepper) production. Accelerating the deployment of climate-resilient cultivars requires effective use of genetic diversity conserved in genebanks. We implement a "turbocharging" strategy in Capsicum by integrating genome-wide association studies and genomic prediction in a core collection (n = 423), followed by genomic prediction across the global collection (n = 10,250) using the core as a training population. We generated genomic estimated breeding values (GEBVs) for 31 high-accuracy traits (r > 0.5) encompassing hyperspectral phenotypes (heat/control), agronomic performance (heat/control) and fruit quality. To enhance accessibility and decision-making, we developed a large language model (LLM) integrated application that enables flexible, preference-based selection of candidates. By narrowing the parental decision space, this framework streamlines screening of large germplasm collections while balancing climate resilience, quality attributes and market demands. Our approach provides a scalable decision-support system to accelerate climate-resilient Capsicum breeding and maximize global genetic resources.

13
Development of KASP Molecular Markers and Construction of Fingerprinting for Cowpea (Vigna unguiculata (L.) Walp) unguiculata Based on ddRAD-Seq

Xiang, J.; Zhong, Z.; Zhang, C.; He, M.; Cai, K.; Gu, L.; Xu, L.; Su, S.; Zou, Y.; Li, J.; Cui, K.; Qiu, H.; Xian, B.; Fu, S.; Chen, L.; liu, x.

2026-02-13 plant biology 10.64898/2026.02.12.705477 medRxiv
Top 0.1%
18.3%
Show abstract

Cowpea (Vigna unguiculata (L.) Walp.) is a globally important legume crop. However, the scarcity of efficient molecular markers has hindered molecular breeding efforts and the protection of plant breeders rights. In this study, we employed double-digest restriction-site associated DNA sequencing (ddRAD-seq) to characterize the genetic diversity of 19 core cowpea germplasm accessions. We generated 791,621 SNPs, from which 13,469 high-quality SNPs were filtered. Population structure and phylogenetic analyses revealed that these accessions cluster into three distinct groups. To facilitate cost-effective and rapid genotyping, we developed a panel of KASP (Kompetitive Allele Specific PCR) markers. Through rigorous screening for polymorphism and stability, we identified six core KASP markers located in exonic regions. These six markers alone were sufficient to discriminate all 19 accessions. Based on these core markers, we constructed a unique DNA fingerprinting profile and assigned specific QR codes for each accession. This study demonstrates that extracting core KASP markers from ddRAD-seq data is a powerful strategy for germplasm identification. The developed fingerprinting system provides a robust, low-cost tool for seed purity testing, variety authentication, and marker-assisted selection in cowpea breeding programs.

14
SABER: A Multiparental Tomato Population Leveraging Wild Relative Diversity for High-Resolution QTL Mapping

Gabelli, G.; Caproni, L.; Palumbo, F.; Boni, A. G.; Ferrari, G.; Prazzoli, L.; Malatrasi, M.; Sestili, S.; Dell'Acqua, M.; Beretta, M.; Barcaccia, G.

2026-03-20 plant biology 10.64898/2026.03.18.712672 medRxiv
Top 0.1%
18.3%
Show abstract

The narrow genetic base of cultivated tomato (Solanum lycopersicum L.) represents a major constraint on crop improvement, necessitating the exploitation of wild relatives to broaden allelic diversity. Here we present SABER (Solanum lycopersicum Allele Biodiversity Enriched Resources), a novel eight-founder Multiparent Advanced Generation Intercross (MAGIC) population that, for the first time, incorporates the Galapagos wild relative Solanum cheesmaniae as a founder alongside seven elite S. lycopersicum lines. Following a structured crossing scheme and Single Seed Descent advancement, F6 recombinant inbred lines were genotyped at 5,850 high-confidence SNP markers using Single Primer Enrichment Technology (SPET). Population structure analyses confirmed low residual heterozygosity, limited substructure among offspring, and successful introgression of S. cheesmaniae alleles across all twelve chromosomes. Mapping performance was validated through three Mendelian traits with known genetic determinants, all of which resolved to genomic positions consistent with the literature. QTL mapping for quantitative agronomic traits identified known loci for fruit epicarp and flesh color, and two novel QTL for days to flowering, number of leaves before flowering, and soluble solids content. Together, these results demonstrate that SABER is a powerful and reliable platform for high-resolution QTL mapping and candidate gene discovery, and establish a replicable framework for integrating wild germplasm into multiparental tomato breeding resources

15
Optimization of Agrobacterium-mediated transformation of commercial heirloom tomato cultivars to develop novel traits via CRISPR/Cas9 Genome Editing

Oxendine, J.; Ibarra-Reyes, E.; Ma, J.; Li, C.; Baron, S.; Hwang, A. E.; Wang, R.; Rodriguez-Leal, D.

2026-02-10 plant biology 10.64898/2026.02.07.703897 medRxiv
Top 0.1%
17.9%
Show abstract

Genetic improvement using new genome editing approaches rely on the efficient delivery of the CRISPR/Cas system in the vegetable crop tomato. Previous protocols for tomato transformation have primarily focused on a handful of cultivars (M82, Alisa Craig, Microtom, Sweet-100) with very little commercial relevance, and it is not clear if these protocols can be implemented directly in other commercially relevant varieties. Heirloom tomatoes are sought for their deep and diverse flavor but have not been subjected to systematic crop improvement via conventional breeding or biotechnology approaches such as transgenesis or genome editing. Therefore, we tested the transformation and regeneration capacity of six different heirloom cultivars known for their superior taste and market relevance in the US. Subsequently, we optimized rooting conditions and used the GRF4-GIF1 chimeric developmental regulator to successfully recover transgenic plants. Finally, we evaluated the efficiency of targeted genetic modification using the CRISPR/Cas9 genome editing system in several of these cultivars. We demonstrate that our optimizations led to successful transformation of several heirloom varieties, including the generation of edited plants for target genes modifying plant architecture and flowering time. Our results set the foundation for a biotechnology platform to deliver improved traits to local and regional heirloom varieties using genome editing.

16
Strigolactone signaling regulates corm development through SPL15-mediated hormonal crosstalk in banana

Long, F.; Zhao, M.; Wu, P.; Zhou, Y.; Huang, X.; Mo, T.; Hu, X.

2026-03-16 bioinformatics 10.64898/2026.03.12.711407 medRxiv
Top 0.1%
17.6%
Show abstract

Strigolactones (SLs) are an important class of plant hormones that play crucial roles in regulating plant branching, root architecture, and organ development. However, the regulatory mechanisms underlying the crosstalk between SLs and other plant hormones remain largely unclear, particularly regarding the key regulatory genes that integrate and coordinate multiple hormonal signaling pathways. In this study, secondary cup seedlings of the Pisang Awak banana cultivar Yufen 6 at the eight-leaf stage were used as experimental materials. The roots were treated with a nutrient solution containing 30 mol/L exogenous SLs, while a nutrient solution supplemented with water served as the control. Tissues near the corm growth point were collected at 0, 15, 30, 60, 90, and 120 days after treatment to measure corm weight, height, and diameter, and transcriptome sequencing was performed using the collected tissues. Differentially expressed genes (DEGs) at different treatment stages were identified, followed by Gene Ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses to systematically investigate the crosstalk between SLs and endogenous hormone metabolism and signaling during corm development in Pisang Awak banana. The results showed that SL treatment significantly inhibited the weight, height, and diameter of the corm. The regulatory effect of SLs on Pisang Awak banana corm development exhibited a clear temporal dynamic pattern, representing a gradual accumulation process that ultimately triggers key developmental transitions. The highest number of DEGs was detected at 15 days after treatment, including 3943 upregulated genes and 3704 downregulated genes, indicating that this stage represents a critical phase for SL response initiation. GO enrichment analysis revealed that the DEGs were mainly involved in metabolic processes, biological regulation, response to stimulus, and regulation of biological processes. KEGG pathway analysis indicated that these DEGs were significantly enriched in pathways related to plant hormone signal transduction, starch and sucrose metabolism, and secondary metabolite biosynthesis. Further analysis revealed that the crosstalk between SLs and multiple hormone metabolic and signaling pathways is mediated by the SPL15 gene, involving auxin (IAA), cytokinin (CTK), abscisic acid (ABA), brassinosteroids (BRs), gibberellins (GA), and jasmonic acid (JA) pathways. This study reveals the molecular mechanism by which SLs regulate Pisang Awak banana corm development through SPL15-mediated integration of multiple hormonal signals, providing new insights into the role of SLs in regulating the development of underground organs in banana.

17
Near-complete, haplotype-resolved genome assembly of common buckwheat (Fagopyrum esculentum Moench)

Hess, F.; Chen, Y.; Lopez Ortiz, M. E.; Colliquet, A.; Stoffel-Studer, I.; Mac, V.; Grob, S.; Koelliker, R.; Studer, B.

2026-04-01 genomics 10.64898/2026.03.30.715208 medRxiv
Top 0.1%
16.9%
Show abstract

Common buckwheat (Fagopyrum esculentum Moench) is a globally cultivated pseudocereal with a high nutritional quality and economic value. Due to its self-incompatibility, common buckwheat exhibits a high level of heterozygosity, making genome assembly challenging. Consequently, reference-level haplotype-resolved assemblies of common buckwheat are scarce, hindering research and genomics-assisted breeding. Here, we present a near-complete, chromosome-level, haplotype-resolved assembly of a common buckwheat F1 genotype (named Tuka), generated using a trio-binning approach that integrated parental Illumina short-read data with PacBio HiFi and Hi-C data from Tuka. The Tuka assembly comprises two haplomes, Tuka_h1 and Tuka_h2, both showing high contiguity (contig N50 of 76.68 Mb and 84.57 Mb, respectively), high completeness (assembly sizes of 1.28 Gb and 1.23 Gb with BUSCO scores of 96.9% and 96.8%, respectively), high base-level accuracy (QV of 59.08 and 63.03, respectively), and few gaps (35 and 30, respectively). This near-complete assembly of Tuka serves as a valuable genomic resource for common buckwheat, enabling advanced genomic analyses and accelerating research and breeding using state-of-the-art genomic tools.

18
Multidimensional analysis of drought response in an inter-specific tomato population (ToMAGIC)

Antar, O.; Rivera, A.; Fenero, D.; Serrano, L.; Alache, K.; Kabas, A.; Bancic, J.; Plazas, M.; Gramazio, P.; Prohens, J.; Vilanova, S.; Casals, J.

2026-02-19 genomics 10.64898/2026.02.18.706544 medRxiv
Top 0.1%
14.7%
Show abstract

Drought stress poses a significant threat to agricultural productivity, particularly in regions with limited water availability. This study delves into the drought response in a multiparental interspecific tomato MAGIC population (ToMAGIC), developed by intercrossing Solanum pimpinellifolium (SP) and S. lycopersicum var. cerasiforme (SLC). A core collection of 139 recombinant lines, selected for their genetic diversity, was evaluated under both control and water stress conditions over two consecutive years. Phenotypic data were collected for 25 traits, including vegetative growth, flowering, fruit production, and physiological traits, providing a comprehensive assessment of drought response. Genome-wide association studies (GWAS) identified 15 significant genomic regions associated with drought response across eight chromosomes, highlighting key loci related to growth, earliness, fruit set, and physiological traits such as stomatal conductance and proline accumulation. Transgressive lines, such as S5_T_600 and S5_T_601, which exhibit enhanced drought resilience compared to the parental lines, were identified through genomic assisted selection, highlighting their potential as valuable breeding materials. The study emphasizes the importance of the ToMAGIC population in uncovering the polygenic nature of drought response. These findings offer valuable insights for developing drought-resilient tomato cultivars supporting agricultural sustainability in water-limited environments.

19
Reference genome assembly of the tetraploid tuber crop Tropaeolum tuberosum

Scheffler, I.; Greb, T.; Hunziker, P.

2026-02-17 plant biology 10.64898/2026.02.14.705015 medRxiv
Top 0.1%
14.2%
Show abstract

Tropaeolum tuberosum is a tetraploid tuber-forming crop with agroecological and agronomic potential, yet genomic resources for this species remain scarce and limit genetic and functional studies. To address this gap, we generated a reference genome assembly for T. tuberosum using PacBio HiFi reads with an estimated genome size of 418 Mb based on k-mer analysis. The final assembly spans 1.32 Gb with 2,189 contigs (contigs N50 = 32.2 Mb, longest contig = 60 Mb) and recovers 79 % of the estimated genome size. We assessed assembly completeness and accuracy using Benchmarking Universal Single-Copy Orthologs (BUSCO), which detected 98.8 % complete genes (21.6 % single-copy, 77.1 % duplicated), 0.5 % fragmented, and 0.7 % missing, demonstrating near-complete gene space recovery consistent with a high-quality tetraploid reference genome. Repetitive sequences account for 71.6 % of the genome, and we annotated 87,927 protein-coding genes using Helixer. This reference genome assembly represents the first genome-scale resource for T. tuberosum and will enable studies of evolution, domestication and comparative genomics, and support breeding, conservation, and functional genomics in this species and related taxa.

20
Rice Annotation Project Database (RAP-DB): literature-curated gene annotation and integrated omics resources for rice functional genomics and molecular breeding

Kawahara, Y.; Kishikawa, T. H.; Hirata, R.; Wang, X.; Tamagaki, Y.; Kumagai, M.; Tabei, N.; Sakai, H.; Itoh, T.

2026-01-21 bioinformatics 10.64898/2026.01.16.699882 medRxiv
Top 0.1%
14.0%
Show abstract

High-throughput sequencing technologies have enabled the generation of high-quality reference genomes for numerous rice cultivars. However, inferring gene functions, associated phenotypes, and causal variants from these sequences remains challenging. The Rice Annotation Project Database (RAP-DB; https://rapdb.dna.affrc.go.jp) is a curated genomic resource that provides comprehensive gene annotations for the reference genome of Oryza sativa ssp. japonica cv. Nipponbare. Since its major update in 2013, gene models and functional annotations have been continuously revised through expert manual curation of newly published literature related to rice genes. As of March 2025, a total of 6,631 transcripts corresponding to 6,371 loci have been curated based on 4,699 peer-reviewed publications. These curated genes are functionally characterized and are frequently associated with agronomic traits, including yield components, stress tolerance, and disease resistance. To support molecular breeding, RAP-DB now provides a curated catalogue of 904 agronomically important loci, including gene symbols, functional descriptions, and associated traits, together with more than 1,000 functionally characterized alleles compiled from the literature. In addition to in-house expert curation, RAP-DB integrates community-curated datasets for major gene families, such as WRKY transcription factors, S-domain receptor-like kinases, and leucine-rich repeat-containing receptors, thereby expanding coverage of key regulatory and defense-related genes. RAP-DB also incorporates reanalyzed RNA sequencing expression profiles alongside microarray-based expression data and co-expression networks, offering gene-centric views of expression patterns across tissues, conditions, and developmental stages. Furthermore, RAP-DB is linked to genome-wide variation datasets from diverse rice varieties through the TASUKE+ genome browser, enabling exploration of allelic diversity across varieties. To enhance annotation quality and long-term sustainability, AI-assisted literature screening and a web-based feedback system have been introduced, allowing users to submit corrections to gene models and report newly characterized genes or relevant publications. Together, these developments strengthen RAP-DB as a primary, literature-based gene annotation resource and provide a practical foundation for molecular breeding in rice.